Stereoscopic image processing methods and apparatus

ABSTRACT

Stereoscopic image processing methods and apparatus are described. Left and right eye images of a stereoscopic frame are examined to determine if one or more difference reduction operations designed to reduce the luminance and/or chrominance differences between the left and right frames is within a range used to trigger a difference reduction operation. A difference reduction operation may involve assigning portions of the left and right frames to different depth regions and/or other region categories. A decision on whether or not to perform a difference reduction operation is then performed on a per regions basis with at the difference between the left and right eye portions of at least one region being reduced when a difference reduction operation is to be performed. The difference reduction process may, and in some embodiments is, performed in a precoder which processes left and right eye images of stereoscopic frames prior to stereoscopic encoding.

RELATED APPLICATIONS

The present application is a continuation of U.S. patent applicationSer. No. 13/744,400 filed on Jan. 17, 2013 which claims the benefit ofU. S. Provisional Patent Application Ser. No. 61/587,642 filed Jan. 17,2012 each of which is hereby expressly incorporated by reference in itsentirety.

FIELD

The present invention is related to the field of stereoscopic imageryand more particularly, to methods and apparatus for encodingstereoscopic image data, e.g., image data representing one or morestereoscopic frames, e.g., a sequence of stereoscopic frames.

BACKGROUND

Stereoscopic images, e.g., 3D images, are growing in popularity. In thecase of stereoscopic images each image normally corresponds to a pair offrames, e.g., a left eye image and a right eye image, captured fromcameras spaced apart by a distance approximating the distance between ahuman's left and right eye. During playback, the left eye image issupplied to the left eye and the right eye image is supplied to theright eye. Minor differences between the left and right eye areperceived and give a sense of depth.

Reflections off objects and/or other camera issues may result inundesired differences in the left and right eye images beyond thedifferences which allow for the perception of depth. Such undesirabledifferences may have negative effects on the perceived image quality ofa stereoscopic image which is played back and viewed.

While some undesirable differences between left and right eye images maybe due to issues relating to image capture, transmission and storageconstraints may also have an impact on perceived image quality. Toreduce the amount of data required to store and/or communicate images,stereoscopic images are often encoded, e.g., compressed, using lossycompression techniques. Some encoding techniques involve encoding thecontent of one image in a stereoscopic image pair representing astereoscopic frame as a function of the other image in the stereoscopicframe. Large differences between the left and right eye images of animage pair representing a stereoscopic frame may render such compressiontechniques less effective than is desirable and/or cause undesirablecoding artifacts which may be visible when viewing a stereoscopic imagegenerated during playback from the encoded left and right eye imagescorresponding to a stereoscopic frame.

In view of the above discussion, it should be appreciated that it wouldbe desirable if one or more methods and/or apparatus could be developedwhich could reduce or eliminate the negative effects of undesirable orexcessive differences between left and right eye images of astereoscopic frame. While methods and apparatus which could beincorporated into an encoder would be desirable it would be particularlydesirable if methods of processing stereoscopic images to reduceundesirable differences could be developed which were generally encoderindependent, e.g., which could be used with any of a wide variety ofstereoscopic encoding techniques.

SUMMARY

Methods and apparatus for processing stereoscopic image data, e.g.,luminance values (Y), first chrominance (Cr) values and secondchrominance (Cb) values, corresponding to left and right eye images of astereoscopic frame, are described. The luminance image values may, andoften are, provided at a higher resolution, e.g., more values per agiven portion of an image, than the chrominance values. The methods andapparatus of the invention do not depend on the resolution of theluminance and/or chrominance values and the described techniques can beused whether the luminance and chrominance values are provided at thesame or different resolutions.

While the methods are described in terms of sets of Y, Cr, and Cb imagevalues the methods of the invention are equally well suited for systemswhich represent stereoscopic frames using sets of R, G, and B (Red,Green, Blue) values.

The image data corresponding to multiple sequential frames, e.g.,corresponding to a video sequence of a movie, television program orother video sequence, may be processed, e.g., sequentially, prior toencoding and/or as part of an encoding process in accordance with one ormore embodiments described herein.

The processing of a stereoscopic frame, e.g., a pair of left and righteye images, will now be described with the understanding that to processvideo sequences the same or similar processing would be performed tomultiple stereoscopic frames in the video sequence.

In accordance with one feature, a frame is first checked to determine ifit would benefit from a reduction in the level of differences betweenthe left and right eye images in terms of first, second and/or thirdimage values, e.g., luminance values, first chrominance values and/orsecond chrominance values. The determination for each one of thedifferent types of values can be made independently and in parallel. Insome embodiments one of the three sets of values is processed while inother embodiments more than one set, e.g., all three sets of values areprocessed and subjected to a possible disparity, e.g., difference,reduction operation to reduce the difference between the values of agiven type between the left and right eye images. Since a human's visualsystem tends to be more sensitive to luminance than chrominance, inembodiments where the disparity check and possible reduction operationis performed on a single type of image values, the single type of imagevalues is normally the luminance values rather than one of the sets ofchrominance values.

To determine if a frame will benefit from a disparity reductionoperation an overall disparity value is generated for the type of dataunder consideration, e.g., luminance data for purposes of explanation.In the case of luminance data, the overall disparity value is anindication of the overall luminance disparity between the left and righteye images of the frame being processed. If the overall disparity valuefalls within a range used to determine that the frame will benefit froma disparity reduction operation, a disparity reduction operation will beperformed on the frame.

In some embodiments determining if the disparity reduction operationwill be beneficial includes comparing the overall disparity value forthe frame to a first, lower threshold and to a second, higher threshold.In some embodiments the lower threshold is 20% of the maximum possibledisparity (e.g., maximum possible luminance difference) between the lefteye image and the right eye image. In at least one such embodiment thesecond upper threshold is 70% of the maximum possible disparity.Disparity's outside this range are indicative of significant differencesbetween the left and right eye images, e.g., they are different imagespossibly because of an image capture error or some other reason, andattempting to reduce the disparity may result in ghosting due to anobject of one image being copied into the other in an attempt to reducethe disparity. Accordingly, to avoid possibly introducing changes thatmight worsen the end image result when the overall disparity value isoutside the range identified for purposes triggering a disparityreduction operation, the disparity reduction operation will be skipped.

However, when the overall disparity value for a type of image data,e.g., luminance data, is within the range used to trigger a disparityreduction operation, a disparity reduction operation will be preformedto reduce the difference between the left and right eye images in termsof the type of data with the overall disparity value within the rangeused to trigger the disparity reduction operation.

As part of a disparity reduction operation, image values of one and/orboth images of a stereoscopic frame will be altered to reduce thedisparity that was detected between the left and right eye images.

Decisions on whether to perform a disparity reduction operation on aframe is performed independent for each of the types of image data beingprocessed. Thus, a decision to perform a luminance disparity reductionoperation does not mean that a chrominance disparity reduction will alsobe performed. However, a chrominance disparity reduction may beperformed depending on the overall chrominance disparity values (ODVCr,ODVCb) for the frame being processed. Similar to luminance, in someembodiments the overall chrominance disparity, e.g., (ODVCr) isevaluated to determine within it is within the range set by a lowerthreshold of 20% of maximum chrominance disparity possible (e.g., 20% ofmaximum chrominance Cr disparity possible) and an upper threshold of 70%of maximum chrominance disparity possible (e.g., 70% of maximumchrominance Cr disparity possible) and if it is within the range than achrominance disparity reduction may be performed (e.g., on thechrominance Cr).

In some embodiments the area of a frame, and thus the area of the leftand right eye images which correspond to the frame, is treated as asingle region for disparity reduction purposes. However, in moresophisticated embodiments portions of the left eye image and portions ofthe right eye image are independently mapped to various regions whichare supported. In some embodiments the different regions which aresupported correspond to different depths, e.g., different distances fromthe camera to the objects in the image portion being mapped to a region.For example, in one embodiment a foreground region, a middle region anda background region are supported. The mapping of image portions toregions involves in one embodiment generating an image depth map for theleft eye image and an image depth map for the right eye image with eachdifferent depth of the depth map corresponding to a different region,e.g., foreground, middle or background. A variety of know stereoscopicdepth map generation techniques may be used to generate the depth mapused to assign image portions to regions. Each region of an image, e.g.,left eye image or right eye image, may include a plurality ofnon-contiguous portions of the image assigned to the region, e.g.,foreground. The left and right eye images may have different numbers ofpixel values assigned to different regions but with the total number ofpixels in each of the left and right eye images normally being the same.

Performing a difference reduction operation on a frame in someembodiments is performed on a per region basis. In one embodiment thedecision whether or not to perform a difference reduction operation on aparticular region of the left and right eye images is made in the sameor similar manner as the processes of deciding whether a differencereduction operation is to be performed on a frame but with the image,e.g., pixel, values of the region of the left and right eye images beingprocessed to generate a disparity value for the region as opposed to theentire frame. Since a region, e.g., the foreground region, in the lefteye image may include a different number of pixel values than the sameregion, e.g., foreground region, in the right eye image the sets ofimage values, e.g., luminance values, are scaled so that the sets ofvalues can be compared as if they each included the same number of pixelvalues. If the number of pixels in the region in the left eye and theright eye are the same, there is no need for the scaling operation, alsosometimes referred to as a normalization operation. For example, thereis no need for scaling in the case of generating an overall disparityvalue for a frame when the left and right eye images include the samenumber of pixel values.

In one embodiment, after scaling, if needed, the pixel values of a lefteye region are processed to generate a left eye histogram and the righteye region pixel values are processed to generate a histogram for thesame region of the right eye. The histogram, in some embodiments, is aset of counts where each count is a count of the number of pixels in theregion having the pixel value to which the count corresponds. Forexample if pixel values can range from 0 to 255, there would be 256counts of pixel values where the count corresponding to a pixel value issometimes stored in a memory location sometimes referred to as a “bin”.The count for any one pixel value is in the range of 0 to the totalnumber of pixels in the region. In some embodiments, scaling isperformed after the histograms have been generated.

The left and right eye image values are compared to generate a set ofdifference values representing the difference between the counts in thebins of the left eye image and the counts in the bins of the right eyeimage. Thus, in some embodiments if there are 256 pixel values 256difference values are generated with each of the 256 difference valuescorresponding to a different pixel value. In one embodiment thedifference value for the region is generated as a weighted sum of thedifference values. For example, in one particular embodiment thedifference value of each bin is multiplied by the pixel valuecorresponding to the bin with the results of this weighting being summedto form the overall pixel value difference. This weighting takes intoconsideration that the bins correspond to different intensities. Forexample a luminance pixel value of 255 representing a fully on pixelwhich will contribute approximately 255 times as much to the overallluminosity of a region than a 0 luminance intensity pixel value.

The overall disparity value generated for a region is checked to see ifit falls within the range used to trigger a disparity reductionoperation. The range may be the same as that described with regarding tomaking a decision on whether or not to perform a disparity reductionoperation on the frame however a different range may be used.

If the disparity reduction value generated for the region falls withinthe range used to trigger a disparity reduction operation on the regiona disparity reduction operation will be performed on the region. Such anoperation will result in the changing of one or more pixel, i.e., image,values in the left and/or right eye image portions which were determinedto correspond to the region being processed. The difference (disparity)reduction operation modifies image values to reduce the differencebetween the histograms, e.g., scaled histograms if scaling wasperformed, for the left and right eye image portions corresponding tothe region subject to the difference reduction operation. In someembodiments the difference reduction operation is performed using ahistogram matching operation. In other embodiments the differencereduction operation is performed using an earth mover's distanceoperation. Both the histogram matching operation and the earth mover'sdistance operation are know operations which can be used to determinehow to alter image values to reduce the difference between histograms ofimage values corresponding to different images.

While a difference operation will be performed on at least one region ofa frame in which a decision was made to perform a difference reductionoperation, not all regions may be subject to a difference reductionoperation. For example, when region based difference reduction decisionsare made, a decision to perform a difference reduction operation mayresult in a difference reduction operation being performed on backgroundimage areas but not foreground image areas depending on the region baseddecisions which are made.

A processed set of frame data is generated from the received image dataand the processing performed as part of a difference reductionoperation. Image values which are altered as part of a differencereduction operation are included in the processed frame in place of theoriginal input values. However, image values which are not altered bythe processing will be included in the processed frame withoutalteration. Thus, despite processing to reduce differences between leftand right eye images the number of luminance and/or chrominance valuesin the input image and processed output image generated by theprocessing which is performed will both normally include the same numberof image values assuming the processing does not alter the size or shapeof the frame.

The processed frame including left and right eye images generated by thedescribed processing is normally subject to encoding and then storage,transmission and/or decoding and display. The stereoscopic imagesnormally, and often do, correspond to real world scenes, e.g., sportingevents, animal observations, parts of a movie being captured for moviegeneration purposes, etc.

The processing performed prior to encoding or as part of an encodingprocess can enhance compression, particularly in cases where differenceencoding techniques are used, and/or reduce annoying differences betweenleft and right eye images that may be perceived by a viewer and which donot contribute to the desired 3D effect in a significant and/ormeaningful way.

While the same or similar types of thresholds may be used for makingdecisions whether to perform luminance and chrominance differencereduction operations, the maximum permitted chrominance differencereductions may be limited to being less than the permitted amount ofluminance reduction to avoid introducing undesirable color distortions.For example changes in chrominance to reduce a chrominance differencemay be limited to 20% or less than the maximum possible chrominancedifference between left and right eye images while changes in luminanceas part of a difference reduction operation may go as high as 50% of themaximum possible luminance difference between left and right eye images.

In some embodiments the processing performed in accordance with theinvention is implemented in a precoder device, e.g., by a processorconfigured to implement the method of the invention, with the precoderbeing followed by an encoder as part of a stereoscopic encoding anddistribution system. The precoder can be used with a wide variety ofstereoscopic encoders.

In other embodiments the precoder functionality is incorporated directlyinto an encoder.

While thresholds used in accordance with the invention may bepreconfigured and set, in other embodiments they are user configurableand can be altered by a user based on observed results, e.g., thequality of processed, encoded and then decoded and displayedstereoscopic images produced using the precoder of the invention.Different thresholds may be set for luminance and/or chrominance and theamount of luminance and/or chrominance different reduction to beperformed as well as difference reduction information indicating theachieved disparity reduction on frames of a sequence. These thresholdsand difference reduction information may be stored and made available toa user controlling an encoder and/or the precoder of the invention.

While various exemplary embodiments and features have been described,numerous additional features and embodiments are described in thedetailed description which follows.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1A which is a first part of FIG. 1 illustrates a first portion ofan exemplary method for processing image data in accordance with oneembodiment of the present invention.

FIG. 1B which is a second part of FIG. 1 illustrates a second portion ofthe exemplary method for processing image data in accordance with theembodiment of the present invention shown in FIG. 1A.

FIG. 1C which is a third part of FIG. 1 illustrates a third portion ofthe exemplary method for processing image data in accordance with theembodiment of the present invention shown in FIGS. 1A and 1B.

FIG. 2 illustrates an exemplary subroutine for generating a disparityvalue in accordance with one embodiment of the present invention.

FIG. 3 illustrates an exemplary subroutine for determining if adisparity value is in a difference range used to trigger a differencereduction operation in accordance with one embodiment of the presentinvention.

FIG. 4 illustrates an exemplary difference reduction operationsubroutine in accordance with one embodiment of the present invention.

FIG. 5 illustrates an exemplary apparatus in accordance with oneembodiment of the present invention.

FIG. 6 illustrates exemplary left eye image luma histogram, right eyeimage luma histogram and an exemplary normalized histogram weightedtowards luminance reduction, e.g., exposure reduction, generated fromsaid left and right eye image luma histograms.

FIG. 7 illustrates exemplary left eye image luma histogram operation,right eye image luma histogram operation, an exemplary left eye lumahistogram output, and an exemplary right eye luma histogram output.

FIG. 8 illustrates an exemplary system that implements an embodiment ofthe present invention.

FIG. 9 illustrates an assembly of modules which may be used in theapparatus of FIG. 5 or the precoder of FIG. 8.

DETAILED DESCRIPTION

Differential stereoscopic encoders are sensitive to input quality,especially to disparity between eye views. This can reduce the encoder'srobustness when input quality drops. Without mitigation, poor inputquality could reduce live encoder efficiency. In some instances, theencoders could amplify differences in input content.

An exemplary process in accordance with the invention is designed toincrease transmitted stereoscopic quality relative to other methods.This helps avoid lowered performance or the introduction of artifactsthat may damage stereoscopic quality leading to unacceptableconsequences of low encoded image quality in the encoded image andultimately the 3D image generated by decoding the encoded image(s).

Various features discussed herein can be used to enhance a real-timestereoscopic encoder to maintain encoded stream quality when the qualityof input stereoscopic material is poor, e.g., includes undesirabledifferences between left and right eye images corresponding to the sameframe.

The methods and apparatus of the present invention are particularlyuseful when encoding live images where the amount of processing and/orthe time available at an encoder for encoding stereoscopic images islimited. By reducing the difference between left and right eye views,the encoding process in many encoders can be simplified since there willbe a greater match between left and right eye views allowing for simplerinter-image encoding, e.g., difference encoding and/or prediction basedencoding, then would be possible with larger differences between theleft and right eye images.

Live content often has defects, such as noise, sync drift, camerageometry disparity, lens flares, polarization disparity frombeamsplitter mirrors, chromatic disparity, exposure disparity andswitching artifacts, which may go uncorrected through the broadcastpipeline. The precoder of the present invention addresses and/or reducesat least some of the defects that may be created in a stereoscopic pairof images as the result of such devices.

Often, live camera outputs have severe differences in luminance causedby one camera in the beamsplitter rig recording glare, as a result ofthe polarizing effect of the beamsplitter mirror. In such cases, theresultant stereoscopic content, absent processing such as by a precoderof the present invention, is often uncomfortable to view because of theluminance disparity between the left and right eye images.Unfortunately, when the content is passed through a live encoder withoutmitigation, the problem can be compounded. The encoder often strugglesto correct the disparity by iterating its feature-matching algorithmuntil it must abandon the effort and move to the next frame. With so fewmatches found between the features of the left and right eye input, theencoded result has lowered redundancy elimination than might be possibleif more time and/or resources were available for encoding. This problemcould be reduced by allowing the encoder more time to iterate thefeature-matching process. Unfortunately, however, more time is notpossible in a live encoding scenario and/or many other applicationswhere time and/or processing resources are limited for cost or otherreasons.

In addition to the diminished ability to compress the contenteffectively, destructive visual patterns can form in the encoded streamthat correspond to the frequency of the alternating luminosity levels inthe content. These patterns, in some cases, amplify the original defectin the content.

Various features described herein result in an enhancement and/orimprovement to live stereoscopic (3d) encoders and/or other types ofencoders. The difference reduction techniques are robust enough toprovide improvements to periodic input quality fluctuations but alsoprovide improvements in the case of infrequent or sudden differencesbetween left and right eye images of a stereoscopic frame. Theenhancement provided by the precoder of the invention can reduce and/oreliminate the destructive artifacts due to differences between left andright eye input images, without the need to increase encoder latency.

In some embodiments of the present invention the left and right eyeimage data is processed to reduce differences between the left and righteye images therein providing, in many cases, the advantage of increasingencoding efficiency when encoding the left and right eye images. This isparticularly the case when using a differential encoder and/or anotherencoder which encodes one of the left and right eye images based on thecontent of the other one of the left and right eye images. In additionto providing improved encoding efficiency in at least some embodimentsthe difference reduction techniques improve perceived decoded imagequality by reducing differences in the left and right eye images whichmay be due to reflections and/or saturation of the input sensor of oneof the eye images and/or other image capture related problems.

Various features are directed to Stereoscopic Aware SpectralEqualization (SASE) of images as part of a stereoscopic image encodingprocess.

FIG. 1 illustrates image processing in accordance with the inventionthat can, and in some embodiments is, performed by the system shown inFIG. 5.

FIG. 1 illustrates a method which can, and in some embodiments is, usedto perform stereoscopic image equalization, e.g., as part of astereoscopic image precoding operation or as part of a stereoscopicimage encoding operation. The method reduces and/or avoids imbalances,e.g., luminance and/or chrominance imbalances between the left and righteye images of an image pair representing a stereoscopic frame. Themethod can reduce encoding artifacts that might result if the left andright eye images of a stereoscopic image are encoded without suchprocessing and/or may increase encoder efficiency and/or reduce theprocessing time required to encode the input image data.

For purposes of explaining the invention it will be assumed that theprecoder 500 of FIG. 5 implements the method. However, the method mayand in some embodiments is, implemented by the precoder 804 shown inFIG. 8.

The method 100 shown in FIG. 1 begins in step 102 with the methodbeginning to be executed by processor 510 of the precoder 500. Operationproceeds from start step 102 to receive step 104 in which the precoder500 receives the image data to be processed, e.g., first, second andthird sets of image values corresponding to a left eye image and a righteye image. The first, second and third set of image values may include,e.g., luminance (Y) pixel values, first chrominance (C_(R)) values, andsecond chrominance (C_(B)) values respectively. In cases where only oneset of values, e.g., luminance values, are to be processed the sets ofchrominance values would not be processed by the precoder and ifreceived would simply be passed through and/or incorporated into the setof processed frame data without alteration. As should be appreciated thesets of chrominance data while corresponding to the same frame as theluminance data may, but need not be, at a lower resolution than theluminance data. Accordingly, at least in some embodiments each of thefirst and second sets of chrominance data, for the left and right eyeimages of a frame, include fewer image values than each set of luminancedata for the left and right eye images of the same frame. The resolutionof the input data is not critical to the invention.

Operation proceeds from step 104 along three paths which may beimplemented in parallel or sequentially with each path corresponding toone of the three different sets of input data, e.g., luminance Y,chrominance C_(R) and chrominance C_(B). The processing performed oneach of the three different paths is similar.

The luminance processing path from step 104 begins in step 106 in whicha first overall disparity value for luminance (ODVY_(DIFF)) is generatedfrom the luminance data corresponding to the left and right eye imagevalues corresponding to the stereoscopic frame being processed. Step 106may, and normally does, involve a call to a disparity value generationsubroutine such as the disparity value generation subroutine 200 shownin FIG. 2.

The disparity value generation subroutine 200 will now be discussedbriefly to facilitate understanding of the overall method. The disparityvalue generation subroutine 200 is started, e.g., in response to a callfrom the main routine 100 which triggers execution of the subroutine 200e.g., by a precoder or hardware in the precoder 500. In step 204, theimage values 203 for the left and right eye image portions which are tobe used in generating the disparity value are received. In the casewhere the image portions which are to be used is the entire frame, e.g.,to generate an overall disparity value for the frame, the full set ofleft and right eye image values for the type of data, e.g., luminancedata, which is being processed will be received. With the image valuesfor the left and right eye having been received, operation proceeds tostep 206 in which a first histogram is generated from the left eye imagevalues being processed, e.g., the values for the left eye image portionsreceived in step 204. The histogram is a set of counts where each countcorresponds to a different possible pixel value. The count for a pixelvalue indicates the number of pixels in the image portion having thepixel value. For example, a count of 10 for the pixel value 254 wouldindicate that there are 10 pixel values with the value 254 in the lefteye image portion being processed. Each count may, and in someembodiments is, stored in a memory location referred to sometimes as a“bin” or “bucket”. FIG. 6 is an illustration 600 which include a diagram602 that shows an exemplary left eye histogram where axis 608 showspossible pixel values and the vertical axis of diagram 602 correspondsto numbers of pixels. In the FIG. 6 example, the total number pixelvalues in the set of luminance data for each of the left and right eyeimages is 921,600. The horizontal axis has a maximum pixel count of42,000 with the horizontal lines being used to provide an indication ofvarious pixel count level with the highest line corresponding to a42,000 pixel count level. While the numbers shown on the horizontalscale range from 1 to 254 the actual range is from 0 to 255 with thetext indicating 0 and 255 being omitted for space reasons.

From step 206 operation proceeds to step 208 in which a first histogramof right eye image values is generated for the portion of the right eyeimage being processed. Diagram 604 of FIG. 6 shows an exemplary righteye histogram which is similar but not identical in shape to the lefteye image histogram 602 for the frame to which the FIG. 6 examplecorresponds. While steps 206 and 208 are shown as sequential steps theycan be performed in parallel.

In the FIG. 6 example it can be seen that the left eye image of theexemplary frame includes relatively few pixels having the values 1 and254 but a large number of pixels having the value 144.

In addition to the left and right eye histograms 602 and 604. FIG. 6shows a calculated normalized histogram 606 with pixel values rangingfrom 0-255 along the horizontal axis 612. Histogram 606 is shown forpurposes of explaining the invention and, at least in some embodiments,need not be calculated as part of the method. The histogram 606represents the histogram that would result if a single histogram, weregenerated from the full set of left and right eye luminance values andthe total number of pixels in the histogram was the same as that for asingle eye view histogram. Since the total number of pixels would betwice that of the individual histograms, the counts generated byconsidering all the pixel values in the left and right eye images isnormalized to generate histogram 606 by scaling the counts in thehistogram by a factor of 0.5 to make it easier to compare the histogram606 to histograms 608, 610. As should be expected the histogram 606 hasa shape somewhat between that of the left and right eye histograms 608,610. To reduce the difference between the left and right eye histograms,it may be useful to alter pixel values in at least one and/or both ofthe left and right eye images to make the histograms of the modifiedleft and right eye images more closely match. In the case wheremodifications are made to both the left and right eye image values, eachof image values for the left and right eyes are changed, in someembodiments, so that the distribution of values more closely approachesthat of the histogram 606.

Referring once again to FIG. 2, operation proceeds to step 210 where acheck is made to determine if the number of pixels in the firsthistogram of left eye image values is different than the number ofpixels in the first histogram of right eye image values. In the casewhere the portion being processed corresponds to the entire frame theleft and right eye image portions will normally include the same numberof pixel values. However, when a portion of the image being processedhas been identified based on a depth map or another technique, thenumber of pixels in the image region being processed may be differentfor the left and right eye portions. For example one of the left andright eye images may include a larger foreground image portion than theother one resulting in the number of image values being different forthe foreground region when the number in the left eye is compared to thenumber in the right eye.

If the number of image values in the left and right eye image portionsused to generate the histograms in step 206 and 208 are the same,operation will proceed from step 210 to step 214. However, if the numberof image values in the histograms is different operation will proceedfrom step 210 to scaling step 212. In scaling step 212 one of the firsthistogram of the left eye image values or the first histogram of theright eye image values is scaled so that the resulting histograms arebased on the same number of pixels. For example if the first left eyehistogram is based on 1000 pixels and the right eye histogram is basedon 500 pixels the right eye histogram is scaled in some embodiments by 2so that it is based on 1000 pixels. Alternatively the counts in thefirst left eye histogram could be scaled, e.g., multiplied by ½, so thatthe histograms to be compared are based on a count of 500 image values.

Operation proceeds from step 212, when used, to step 214. Thus, in step214 the histograms being used in step 214 are normalized histograms,e.g., left and right eye histograms based on histograms including thesame number of image values. In step 214 a difference histogram isgenerated. Generation of a difference histogram may include subtracting,on a per bin basis, pixel value counts corresponding to the right eyeimage from pixel value counts corresponding to the left eye image. Thisdifference determination process generates a difference value for eachbin, e.g., possible pixel value. The set of difference values representsthe difference histogram for the normalized left and right eye imagehistograms for the region of the image being processed, e.g., entireframe, foreground region, middle ground region or background region.

Operation proceeds from difference histogram generation step 214 todifference value generation step 216. In step 216 a weighted sum ofdifference values is generated by weighting the individual differencevalues included in the difference histogram generated in step 214 andsumming the weighted values to generate a disparity value. In someembodiments the weighting is a simple weighting based on the image valuebin to which the difference value corresponds. Assume for example thatthe bin corresponding to pixel value 5 shows a difference of 20 pixelsbetween the left and right eye images while the bin corresponding topixel value 254 shows a difference of 1 pixel between the left and righteye images. The weighted sum would include the following contribution of100 (20×5) from bin 5 and a contribution of 254 (1×254) from bin 254.Contributions from the other bins of the difference histogram would besummed with these contributions to generate the overall disparity valuein one embodiment. Note that such a weighting approach reflects the factthat the pixel value corresponding to bin 254 will contribute much moresignificantly to the overall luminance or chrominance of the region,depending on which type of images are being processed, than the pixelvalue corresponding to bin 5.

After generation of the weighted sum of difference values in step 216,the generated weighted sum of difference values is returned in step 218as the disparity value generated by the step which called the subroutine200. For example, if subroutine 200 was called from step 106 thereturned disparity value would be the overall disparity value(ODVY_(DIFF)). If subroutine routine 200 was called by step 116 anoverall chrominance disparity value would be returned. Subroutine 200may be called when processing regions of the frame which do not includethe entire frame, e.g., a foreground region. In such a case the returneddisparity value would be indicative of the difference between the leftand right eye foreground regions with the type of data to which thedisparity value applied depending on whether it was generated fromluminance data or one of the sets of chrominance data.

Operation proceeds from step 106 to step 108 in which the disparityvalue generated in step 106 is checked to determine if it is in a firstdifference range used to control performing of a first differencereduction operation, e.g., a luminance difference reduction operationused to reduce the difference between the left and right eye imageluminance histograms. Step 108 may, and in some embodiments is,implemented by a call to subroutine 300 shown in FIG. 3. The range may,in the case of luminance image values, have a lower threshold of 20% ofthe maximum possible difference between the overall luminance of theleft and right eye images and an upper threshold of 70% of the maximumpossible luminance difference between the left and right eye images.However other thresholds are possible. Values outside this range mayindicate some form of error and trying to make the left and right eyeimages similar may create more artifacts and/or degrade image quality.The indicated ranges are exemplary and a user controlling the encodingprocess can alter the range by changing one or more thresholds, e.g.,based on the observed quality of images produced by encoding and thendecoding and displaying the processed images.

If in detection step 110 the first overall disparity value is in thefirst difference range, a disparity reduction operation is triggered byproceeding from detection step 110 to step 112 wherein a firstdifference reduction operation is performed to generate an updated setof first values for the left and right eye images. The updated sets ofleft and right eye image values may include one or more luminance valueswhich have been altered to reduce the difference between the sets ofleft and right eye image luminance values. Step 112 may be implementedby a call to a difference reduction subroutine such as that shown inFIG. 4 and which will be described below.

From step 112 operation proceeds via connecting node 130 to step 132 ofFIG. 1C. Operation proceeds directly from step 110 via connecting node130 to step 132 if in step 110 it is determined that the first overalldisparity value is not in the range which triggers performing adisparity reduction operation.

Steps 106, 108, 110 and 112 correspond to the processing pathcorresponding to first image values, e.g., luminance image values. Steps114, 116, 118 and 120 which correspond to the processing of the secondset of image values, e.g., left and right eye image chrominance values,involve operations which are similar to those previously described withregard to luminance processing in step 106, 108, 110 and 112,respectively.

In step 114 a second overall disparity value (ODVCR_(Diff)) is generatedfrom the Cr (red chrominance) values corresponding to the left and righteye images of the frame being processed. This may be done by making acall to the disparity value generation subroutine 200 and supplying theCr values for the frame being processed.

Operation proceeds from step 114 to step 116 in which a determination ismade as to whether or not the second overall disparity value generatedin step 114 is within a second difference range used to triggerperforming a chrominance disparity reduction operation. The lower andupper limits of the range may, as in the case of the luminanceprocessing be, e.g., 20% and 70% but with the percentages being inregard to the maximum possible (Cr) chrominance difference between theleft and right eye images of a frame being processed.

In step 116 the second overall disparity value is checked to determineif it is in the range that causes image processing to proceed todifference reduction step 120. This check may be performed by making acall to subroutine 300 shown in FIG. 3. Operation then proceeds todecision step 118. If it is determined that the second overall disparityvalue is in the range which triggers a second difference reductionoperation, operation proceeds from step 118 to step 120. In step 130 adifference reduction operation is performed, e.g., by a call todifference reduction subroutine 400. However, if in step 118 the secondoverall disparity value is not in the second difference range used totrigger a disparity reduction operation, operation proceeds from step118 to step 132 shown in FIG. 1C via connecting node C 130. In step 120a difference reduction operation is performed, e.g., via a call thedifference reduction subroutine 400.

The processing of the third set of image values, e.g., a second set ofchrominance values (C_(B)) corresponding to the color blue, is similarto the processing of the first set of chrominance values except with theCb values being used instead of the Cr values. The processing of thesecond set of chrominance values proceeds from step 104 via connectingnode 121 to step 122 and includes steps 122, 124, 126 and 128 which arethe same or similar to previously described steps 114, 116, 118 and 120,respectively, but with different chrominance values being processed andpossibly different thresholds and/or amounts of difference reductionbegin performed. Given the similarity in the processing of the two setsof chrominance values the processing with regard to the second set ofchrominance values will not be discussed further.

FIG. 1C shows the steps associated with producing an output set of leftand right eye image data reflecting the result of any differencereduction operations performed to the first, second and third sets ofimage data, e.g., luminance, Cr and C_(B) data. In step 132 a set ofdata representating processed left and right eye images corresponding tothe frame being processed is generated. In the processed sets of leftand right eye image data to be output, modified image values which werethe result of altering input image values are included in the outputdata set for values which were altered. Image values which were notaltered are included in their unaltered form in the set of processedleft and right eye image data generated by the precoding method. Thus,an output frame corresponding to an input frame will normally includethe same number of pixel, e.g., image, values as the input frame butwith some of the values having been altered from the input values aspart of a difference reduction operation. It should be appreciated thatwhen difference reduction operations are not performed on the second andthird sets of image values, these sets will be output unchanged, e.g.,simply passed through the precoder.

Processed images representing a frame are supplied in step 134 to anencoder which then encodes the left and right eye images. A stereoscopicimage encoder which uses differential and/or predictive coding to codethe left and right eye images of a frame may be used as the encoder. Instep 135 the encoder is operated to encode the processed left and righteye image data supplied to it. Then in step 136 the encoded image datagenerated by the stereoscopic encoder is stored, transmitted and/ordecoded and displayed. A system administrator overseeing the encodingprocess may, and in some embodiments does, alter one or more thresholdsand/or the amount of difference reduction performed by the precoderbased on the perceived quality of images generated from the encodedimage data as a program or event being photographed and encoded is stillon-going.

The process shown in FIG. 1 is an ongoing process allowing a sequence offrames to be processed, encoded and transmitted. This is reflected instep 138 where a check is made as to whether there are any more frames,e.g., from the stereoscopic camera, to be processed. If the answer tostep 138 is yes, e.g., because the event being photographed is stillongoing, operation precedes via connecting node D 140 back to step 104wherein the next frame to be processed is received. However if thedetermination in step 138 indicates that there are no more frames toprocess, operation proceeds from step 138 to stop step 142.

Applied to at least some test content, the difference reductionoperations and related processing shown in FIG. 1, dramatically reducesinput differentials that would otherwise cause deleterious results inthe final decoded image that may have occurred absent the use of suchdifference reduction processing.

Benefits of processing input frames in the manner shown in FIG. 1 mayinclude overall encoder efficiency improvements allowing forstereoscopic frames to be encoded at very low-bitrates. This is due tothe fact that feature-matching is often highly-critical to successfulredundancy elimination at low target bit rates (e.g. 2 Mbps) and thedifference reduction performed in the described embodiment enhancessimilarity and thus feature matching between frames.

The described method may also improve the visual quality of poor, e.g.,low quality input content which may be expected in real time encodingsituations, e.g., live sporting events and/or other situations wherestudio quality control over the image capture process is not possible oris lacking.

In at least some embodiments the precoding and encoding methods are usedas part of a diagnostic process as part of a stereoscopic contentproduction process. In at least one such embodiment the precoder andencoding process described herein is used to produce a real-time measureof production stereo quality that is then used diagnostically in theproduction environment to facilitate camera set up, lighting changesand/or other changes without having to wait for normal post productionprocessing of the captured images to determine the effect of variouschanges or settings on the production process.

FIG. 1 illustrates image processing in accordance with the inventionthat can, and in some embodiments is, performed by the system shown inFIG. 5. The processing can also be performed by the apparatus 804 shownin FIG. 8.

FIG. 3 illustrates an exemplary subroutine 300 for determining if adisparity value is in a difference range. Subroutine 300 may be, and insome embodiments of the present invention is, used for implementingsteps 108, 116, and 124 of method 100 of FIG. 1. For explanatorypurposes, subroutine 300 will be explained as being implemented on theexemplary precoder system 500 of FIG. 5. The subroutine 300 starts atstart step 302 with subroutine 300 being executed on processor 510 fromwhich execution proceeds to step 306.

In step 306, first and second difference range thresholds to be used inthe determination 304 are received for example from a user, e.g.,administrator, via input device 504. In this exemplary embodiment, thefirst difference range threshold being lower in value than the seconddifference range threshold. In some embodiments the first, lowerthreshold is 20% of the maximum possible disparity between the left eyeimage and the right eye image (e.g., when implementing subroutine 108,20% of the maximum possible luminance difference between the left eyeimage and the right eye image). In at least some embodiments, thesecond, upper threshold is 70% of the maximum possible disparity betweenthe left eye image and the right eye image (e.g., when implementingsubroutine 108, 70% of the maximum possible luminance disparity betweenthe left eye image and the right eye image).

In some embodiments, the first and second difference range thresholdsmay be, and are, received from a user during initialization of theprecoder system 502 with the thresholds being stored in memory 512. Inat least some of such embodiments, the processor 510 receives thethresholds from memory 512.

In step 306 the disparity value 305 which the subroutine 300 willdetermine whether it is in the difference range defined by the first andsecond difference range thresholds is also received. In the case of thedisparity value being of type, luminance the disparity value may be thefirst overall disparity value (ODVY_(DIFF)) generated in step 106 ofmethod 100 from the first left and right eye image values received instep 104. In the case of the disparity value being of type chrominance,the disparity value may be the second overall disparity value(ODVCR_(DIFF)) generated in step 114 from the second left and right eyeimage values received in step 104 or the disparity value may be thethird overall disparity value (ODVCB_(DIFF)) generated in step 122 fromthe third left and right eye image values received in step 104. In someembodiments, the luminance and chrominance disparity values of steps106, 114, and 122 are generated using exemplary subroutine 200 with eachof the luminance and chrominance disparity values being outputted instep 218 of the subroutine as disparity value 220 which may be, and insome embodiments is stored in memory 512 with information associatingthe disparity value with it type and its associated image values.

Processing proceeds from receiving step 306 to comparison step 308. Instep comparison step 308 the disparity value 305, e.g., first overalldisparity value (ODVY_(DIFF)) received in step 306 is compared to thefirst, lower difference range threshold, e.g., 20% of the maximumpossible luminance disparity between the left eye image and the righteye image. If the disparity value is above first, lower differencethreshold than processing proceeds to step 310. Other processingproceeds to step determination step 312. In decision step 310, thedisparity value 305 received in step 306 is compared to the seconddifference range threshold received in step 306. If the disparity value305 is below the second difference range threshold than processing todetermination step 314. Otherwise processing proceeds to determinationstep 312.

In determination step 312, it is determined that the disparity value 305received in step 306 is not within the difference range specified by thefirst and second difference range thresholds 304 received in step 306.Processing then proceeds to return step 316 where the disparity valuedetermination result that the disparity value is not in the differencerange is returned to the routine and/or method that invoked subroutine300. Processing then proceeds to stop step 318 where processing inconnection with subroutine 300 concludes but processing in the systemfor example, processing in connection with the returned determinationresult continues. The determination result may be, and in someembodiments is, stored in memory 512 for potential later use beforeprocessing concludes in connection with subroutine 300 at step 318.

In determination step 314, it is determined that the disparity value 305received in step 306 is within the difference range specified by thefirst and second difference range thresholds 304 received in step 306.Processing then proceeds to return step 316 where the disparity valuedetermination result that the disparity value is in the difference rangeis returned to the routine and/or method that invoked subroutine 300.Processing then proceeds to stop step 318 where as discussed aboveprocessing in connection with subroutine 300 concludes but processing inthe system for example, processing in connection with the returneddetermination result continues. The determination result may be, and insome embodiments is, stored in memory 512 for potential later use beforeprocessing concludes in connection with subroutine 300 at step 318. Byway of example, if the received disparity value is a luminance disparityvalue (ODVY_(DIFF)) that is 60% of the maximum potential luminancedisparity between the left and right eye images and the received firstlower difference threshold was 20% of the maximum potential luminancedisparity between the left and right eye images and the received seconddifference threshold value was 70% of the maximum potential luminancedisparity between the left and right eye images than routine 300 woulddetermine and return the result that the received disparity value waswithin the difference range. If however, the disparity difference valuereceived was for example 10% of the maximum potential luminancedisparity between the left and right eye images then routine 300 woulddetermine and return the result that the disparity value of 10% is notin the difference range. While the above example has been explainedmostly in connection with luminance disparity values it is alsoapplicable to chrominance disparity values e.g., ODVCR_(DIFF) andODVCB_(DIFF).

FIG. 4 illustrates an exemplary subroutine 400 for performing adifference reduction operation on one or more regions of either the leftand/or right eye images of a frame. Subroutine 400 may be, and in someembodiments of the present invention is, used for implementing steps112, 120, and 128 of method 100 of FIG. 1. For explanatory purposes,subroutine 400 will be explained as being implemented on the exemplaryprecoder system 500 of FIG. 5. The subroutine 400 starts at start step402 with subroutine 400 being executed on processor 510 from whichexecution proceeds to step 404. In step 404, portions of the left andright eye images are assigned to supported types of regions, e.g.,foreground, background and/or middle ground region to thereby generate aregion map based on characteristics of the image portions beingassigned. For example the region maps may be depth maps wherein thecharacteristics of the image are the depth (foreground, background,middle ground). A discussion of stereo correspondence and depth mappingcan be found at the following Internet website:http://blog.martinperis.com/2011/08/opencv-stereo-matching.html. In atleast some embodiments all portions of the left and right eye images areassigned to a single region which encompasses the entire image in such acase the entire left and right eye images are processed as a singleregion. However, in most embodiments multiple regions, often 3 or more,are supported with image values being assigned to various regions, e.g.,with each image value being assigned to one of the regions. Processingproceeds to step 406 wherein the image region to be processed is set tothe first of the supported types of regions to which an image portionhas been assigned, e.g., foreground region. Processing proceeds to step408.

In step 408, an overall region disparity value for the region beingprocessed based on the disparity between the image region in the leftand right eye images is generated. For example, if the foreground regionis being processed and the disparity value is being generated fromluminance image values, a disparity value, e.g., luminance disparityvalue, is generated for the pixel luminance values assigned to theforeground regions of the right and left eye images. Step 408 may beimplemented by a call to the subroutine 200 of FIG. 2. Processingproceeds from step 408 to decision step 410.

In decision step 410, the overall region disparity value is evaluated todetermine whether it is within the first difference range. The firstdifference range may be the same as the difference range discussed inconnection with subroutine 300 of FIG. 3. If the overall regiondisparity value is in the first difference range then processingproceeds to step 412. Otherwise processing proceeds to step 422.

In step 422, it is determined that the system will not perform adifference reduction operation on the current image region beingprocessed, e.g., the foreground region because the overall disparityvalue is not within the first difference range.

In step 412, a difference reduction operation is performed on the valuescorresponding to the region included in one or both of the left andright eye images. In some embodiments, step 412 includes the optionalstep 414 and/or optional step 416. In optional step 414 a histogrammatching operation is performed as part of the difference reductionoperation of step 412. Histogram matching operations for differencereduction in image processing is known in the art. For example, adiscussion of histogram matching can be found on the Internet at thefollowing website: http://en.wikipedia.org/wiki/Histogram_matching. Inoptional step 416 an earth mover's distance operation is performed aspart of the difference reduction operation of step 412. Earth mover'sdistance operation for difference reduction is also known in the art.For example, a discussion of the earth mover's distance operation isdiscussed at the following website:http://en.wikipedia.org/wiki/Earth_Mover %27s_Distance. While both thehistogram matching operation and earth mover's distance operation may beused in the difference reduction operation, typically only one of thetwo operations is performed as part of the difference reductionoperation.

FIG. 7 is a diagram 700 which shows how the left and right eye luminancehistograms shown in FIG. 6 can be modified to reduce the differencebetween the histograms. In the FIGS. 6 and 7 examples the left and righteye images are of a resolution (1280×720) which results in 921,600luminance values for each of the left and right eye images.

The difference histogram for the exemplary left and right eye images isdetermined from a per bin comparison of the counts in the left and righteye histograms. Such a comparison gives us the number of pixels whichare different. There are 38.84% of the pixels between the left and righteye images which are different (357,960/921,600 total pixels). 38.84% isa straight forward percentage of pixels that are not in the samebuckets. When the count of the number of pixels that are in a differentbin are weighted by the bin's corresponding luminance value this gives aweighted difference of 60/146,935. The maximum weighted difference is,in this example 921,600*255==235,000,800. Which gives us a weighteddifference of 25.59%. Since the difference of 25.59% is in the 20% to70% range used to trigger a difference reduction operation in someembodiments, a luminance difference reduction operation will betriggered in the FIG. 7 example.

Diagrams 702 and 706 are illustrations showing the actual histograms forthe left and right eye images, respectively, using dark solid lines toshown the actual histogram and a lighter dotted line to show the shapeof the histogram if the values of the two histograms were combined andthere was no difference between the histograms as shown in diagram 606.

Arrows are used in diagrams 702 and 706 to show where luminance imagevalues should be increased and decreased to reduce the differencebetween the left and right eye luminance histograms and thus the overallluminance difference between the left and right eye images. Diagrams704, 708 show the histograms of processed left and right eye imagesafter the difference reduction operation is performed. Note that whilethe histograms shown in diagrams 704 and 708 are not identical they aremuch closer in overall shape than the histograms shown in diagrams 702,706 which corresponded to the input image being processed.

While FIG. 7 is an example of processing and reducing the luminancedifference between left and right eye images of a frame where theprocessing is performed on a region which is the size of the entireframe, the same kind of processing can be performed on other sizeregions, e.g., foreground, middle and background regions when portionsof images corresponding to a frame are assigned to such regions andwhere the implemented embodiment performs difference reductions on a perregion basis with the images including multiple regions. Also note whileexplained in the context of luminance values the difference reductionoperation would be the same or similar when applied to chrominancereduction operations. However the maxim amount of change in image valuesin the case of chrominance reduction may be, and sometimes is, less thanthe amount of difference reduction applied to luminance values. The useof different amounts of difference reduction to color and luminance datacan avoid undesirable color changes which may be perceivable to an enduser during playback.

Processing proceeds from steps 412 and 422 to decision step 418. Indecision step 418, a determination is made as to whether each imageregion to which at least one image portion has been assigned has beenprocessed. If all regions have been processed then processing proceedsto return step 424 wherein processing in connection with subroutine 400is concluded as processing returns the routine or method that invokedsubroutine 400. If all regions have not been processed then processingproceeds from step 418 to step 420 where the processor 510 sets to theregion to be processed to the next region and processing proceeds backto step 408 wherein processing continues as previously described butthis time in connection with the next region.

FIG. 5 illustrates a precoder and/or encoder system 500 implemented inaccordance with some exemplary embodiments of the present invention. Thesystem 500 includes a display 502, input device 504, input/output (I/O)interface 506, a camera 509, e.g., a stereoscopic camera, a processor510, network interface 530 and a memory 512 which are coupled togetherby bus 508. The memory 512 includes various modules, e.g., routines,which when executed by the processor 510 control the precoder and/orencoder system 500 to implement the precoding and/or encoding methodswhich have been described. In some embodiments the system 500 implementsthe routines and subroutines 100, 200, 300 and 400.

The memory 512 includes routines 514 and various modules including animage acquisition module 515, a precoder 516, an encoder 518, and adecoder 520. The memory 512 further includes encoded data 522 which isan output of the encoder 518, decoded data 524 which is an output of thedecoder 520. In some embodiments the memory further includes an assemblyof modules 528 which may be, and in some embodiments is implemented aspart of the precoder 516.

Routines 514 include communications routines and/or control routines.

The processor 510, e.g., a CPU, executes routines 514 and one or moremodules to control the system 500 to operate in accordance with theinvention. To control the system 500, the processor 510 usesinformation, various modules and/or routines including instructionsstored in memory 512.

The image acquisition module 515 is configured for receiving and storingcontent to be encoded. The content, e.g., left and right eye image pairsrepresenting frames of a stereoscopic image sequence, are stored in thememory as stored image data 526. The precoder 516 is configured toprocess a pair of images including a left eye image and a right eyeimage, each of said left and right eye images including image values ofa first type, e.g., luminance values or chrominance values, to performvarious precoding operations in accordance with the embodiments of theinvention. Thus in some embodiments various steps discussed with regardto routines 100 through 400 are implemented by the precoder 516.

In some embodiments the precoder 516 is configured to generate a firstoverall disparity value indicating a first overall level of differencebetween values of a first type in a left eye image and values of thefirst type in a right eye image, e.g., in the received left and righteye image pair. In some embodiments the values of the first type are oneof luminance values, chrominance values corresponding to a first color(C_(R)), or chrominance values corresponding to a second color (C_(B)).In some embodiments the precoder 516 is further configured to determineif the first overall disparity value is in a first difference range usedto control performing of a first difference reduction operation, thefirst difference reduction operation being an operation on values of thefirst type included in at least one of the left eye image or the righteye image to reduce the overall difference between values of the firsttype in the left eye image and values of the first type in the right eyeimage, and perform the first difference reduction operation to reducethe first overall level of difference between values of the first typein the left eye image and values of the first type in the right eyeimage when the first overall disparity value is determined to be in thefirst difference range.

Encoder 518 encodes frames with left and right eye images which havebeen subject to the precoding operations. The decoder module 520 can beused to decode an encoded stream and to supply it to the display 502.Thus the decoder 520 decodes images which are then output for display onthe display 502 in some embodiments. The operator may control, via inputdevice 504, one or more encoding parameters via input device 504 and/orselect which of the encoded bitstreams is to be decoded and displayed.The various components of the system 500 are coupled together via bus508 which allows for data to be communicated between the components ofthe system 500.

In some embodiments, the video content, e.g., left and right eye imagesof a 3D image sequence, inputted by the camera 509, e.g., a stereoscopiccamera, are stored in the memory 512 for example in stored image data526. The precoder module 516 processes the left and right eye image datato reduce differences between the left and right eye images. This hasthe advantage of increasing encoding efficiency when encoding the leftand right eye images when using a differential encoder and/or anotherencoder which encodes one of the left and right eye images based on thecontent of the other one of the left and right eye images. In additionto providing improved encoding efficiency in at least some embodimentsthe difference reduction techniques improve perceived decoded imagequality by reducing differences in the left and right eye images whichmay be due to reflections and/or saturation of the input sensor of oneof the eye images and/or other image capture related issues. The encodermodule 518 encodes the precoded image data to generate encoded data. Theprocessing steps shown in FIGS. 1-4 may be, and in some embodiments are,performed by the precoder module or a module which processes left andright eye images. In at least some embodiments of the present invention,some or all of the processing occurring in the precoder module isincorporated into the encoding module. The encoded content can be, andin some embodiments is, streamed to one or multiple different devicesvia the network interface 530. The video content to be precoded and/orencoded can also be streamed through network interface 530. The decodermodule 520 can be, and in some embodiments is, used to decode an encodedstream and to supply it to the display 502. In this manner an operatorof system 500 can view the result of the precoding and encodingprocesses. The operator, e.g., administrator, may control one or moreprecoding and encoding parameters via input device 504 and/or selectwhich of the precoded and encoded bit streams is to be decoded anddisplayed via input device 504. The precoding parameters inputted by theoperator may be stored in memory 512 for use during processing.

FIG. 8 illustrates an exemplary system 800 implemented in accordancewith an exemplary embodiment of the invention. The exemplary system 800includes camera 802, a precoder and/or encoder system 804 and a playbackdevice 830 in accordance with an exemplary embodiment.

The camera 802 in some embodiments is a stereoscopic camera and providesinputs, e.g., left and right eye images, to the precoder and/or encodersystem 804. It should be appreciated that the precoder and/or encodersystem 804 performs the same functions and operation as discussed abovewith regard to the system 500 however in the FIG. 8 embodiment thevarious elements of the precoder and/or encoder system 804 areimplemented in hardware, e.g., as individual circuits. The precoderand/or encoder system 804 includes a precoder 810, an encoder 812, adecoder 814, a display 816, a processor 818, a storage device 820, aninput device 821 and a network interface 822. Various elements of theprecoder and/or encoder system 804 are coupled together via bus 809which allows for data to be communicated between the elements.

The received input from the camera, e.g., left and right eye image pairsof a stereoscopic e.g., 3D, image sequence, may be stored in the memory820 and is used for performing further processing in accordance with theinvention. Thus the left and right eye images are supplied from thecamera 802 to the precoder 810. The precoder 810 is configured toprocess received image pairs, e.g., the left eye image and a right eyeimage, to perform various precoding operations in accordance with theembodiments of the invention. In various embodiments each of said leftand right eye images include image values of a first type, e.g.,luminance values or chrominance values. In some embodiments the precoder810 is configured to implement/perform the precoding operationsdiscussed in the steps of routines 100-400. As will be discussed, insome embodiments the precoder 810 includes one or more modules forperforming various precoding operations discussed above. Depending onthe embodiment, such modules may be implemented completely in hardware,software or as a combination of software and hardware.

The encoder 812 encodes frames with left and right eye images which havebeen subject to the precoding operations. The decoder 814 can be used todecode an encoded stream and to supply it to the display 816. Thus insome embodiments the decoder 814 decodes images which are then outputfor display on the display 816.

The processor 818, e.g., a CPU, controls the operation of the precoderand/or encoder system 804 in accordance with the invention. In variousembodiments the processor executes routines to control the precoderand/or encoder system 804 to operate in accordance with the invention.To control the precoder and/or encoder system 804, the processor 818uses information and/or routines including instructions included in thememory 820. An operator of the precoder and/or encoder system 804 maycontrol the system via various control parameters an/or thresholdsetting which may be input and set via input device 821.

In some embodiments, encoded image data generated after the precodingprocessing operations have been performed on received stereoscopicimages, is communicated, e.g., over the internet or anothercommunications network, via the network interface 822 to a contentplayback device such as the playback device 830.

The exemplary content playback device 830 includes an input/output (I/O)interface 832, a decoder 834, a display 836, a processor 838 and memory840. The playback device receives content, e.g., encoded stereoscopicimages, via the I/O interface 832. The received content is supplied tothe decoder 834 for processing. The decoder 834 recovers the image databy performing decoding operations in accordance with the invention. Thedecoded image data is stored in the memory 840. As per a user's desirethe decoded image data may be, and in some embodiments is, supplied tothe display 836 for display.

FIG. 9 is an assembly of modules 900 which can, and in some embodimentsis, used in the precoder and/or encoder system 500 illustrated in FIG. 5and/or the precoder and/or encoder system 804 illustrated in FIG. 8.Assembly of modules 900 can be implemented in hardware within theprecoder 810 of FIG. 8 and/or precoder 516 of FIG. 5. The modules in theassembly 900 can, and in some embodiments are, implemented fully inhardware within the precoder 810, e.g., as individual circuits. In someother embodiments some of the modules are implemented, e.g., ascircuits, within the precoder 810 with other modules being implemented,e.g., as circuits, external to and coupled to the precoder.

Alternatively, rather than being implemented as circuits, all or some ofthe modules may be implemented in software and stored in the memory 512,e.g., as assembly of modules 528, as shown in FIG. 5 and/or in thememory 820 with the modules controlling operation of the precoder 516and/or precoder 810 to implement the functions corresponding to themodules when the modules are executed by a processor, e.g., processor510 or processor 818. In still other embodiments, various modules areimplemented as a combination of hardware and software.

While shown in the FIG. 5 embodiment as a single processor, e.g.,computer, it should be appreciated that the processor 510 may beimplemented as one or more processors, e.g., computers.

When implemented in software the modules include code, which whenexecuted by the processor 510, configure the processor 510 to implementthe function corresponding to the module. In embodiments where theassembly of modules 900 is stored in the memory 512, the memory 512 is acomputer readable medium comprising code, e.g., individual code for eachmodule, for causing at least one computer, e.g., processor 512, toimplement the functions to which the modules correspond.

Thus completely hardware based or completely software based modules maybe used. However, it should be appreciated that any combination ofsoftware and hardware, e.g., circuit implemented modules may be used toimplement the functions. As should be appreciated, the modulesillustrated in FIG. 9 control and/or configure the precoder and/orencoder system 500 or 804, or elements therein such as the processors510 and/or 818, to perform the functions of the corresponding stepsillustrated in the flow charts shown in FIGS. 1-4.

As illustrated in FIG. 9, the assembly of modules 900 includes adisparity value generation module 902 configured to generate a firstoverall disparity value indicating a first overall level of differencebetween values of a first type in a left eye image and values of thefirst type in a right eye image. The left and right eye images are,e.g., stereoscopic images, received from a camera, e.g., camera 802. Insome embodiments the values of the first type are luminance values. Insome embodiments the values of the first type are chrominance valuescorresponding to a first color (C_(R)), or chrominance valuescorresponding to a second color (C_(R)).

The assembly of modules 900 further includes a determination module 904configured to determine if the first overall disparity value is in afirst difference range used to control performing of a first differencereduction operation, the first difference reduction operation being anoperation on values of the first type included in at least one of theleft eye image or the right eye image to reduce the overall differencebetween values of the first type in the left eye image and values of thefirst type in the right eye image. In various embodiments thedetermination module 904 includes a comparison module 906 configured tocompare the first overall disparity value to a first thresholdcorresponding to a lower boundary of said the difference range, and tocompare the first overall disparity value to a second thresholdcorresponding to an upper boundary of the first difference range. Insome embodiments the determination module 904 is configured to determinethat the overall disparity value is in the first difference range when aresult of a comparison performed by the comparison module 906 indicatesthat said first overall disparity value is above said first thresholdand below said second threshold.

The assembly of modules 900 further includes a difference reductionoperation module 908 configured to perform the first differencereduction operation to reduce the first overall level of differencebetween values of the first type in the left eye image and values of thefirst type in the right eye image when the first overall disparity valueis determined to be in the first difference range. For example, in someembodiments a difference reduction operation is triggered when thedetermination module 904 determines that there is, e.g., more than 25%difference and less than 70% overall luminance difference between leftand right eye images. In various embodiments the difference reductionoperation module 908 includes a modification module 910 configured tomodify values of the first type in a first one of the left and right eyeimages to reduce an overall disparity between the values of the firsttype in a first region of the first one of said left and right eyeimages and the values of the first type included in a second one of saidleft and right eye images, said second one of said left and right eyeimages being different from said first one of said left and right eyeimages. In some embodiments the difference reduction operation module908 further includes a depth map generation module 912 configured togenerate a depth map for each of the left and right eye images, and anidentification module 914 configured to identify different regions inthe left and right eye images based on depths to which the regions aredetermined to correspond, the depths include at least a first imagedepth and a second image depth.

In some embodiments the first image depth corresponds to a firstdistance range from a camera to objects in an image region identified ascorresponding to the first image depth and a second image depthcorresponds to a second distance range from the camera to objects in animage region identified as corresponding to the second image depth, thefirst and second image depths being different. In some embodiments thefirst and second image depths correspond to foreground and backgroundimage regions, respectively.

Some embodiments are directed a non-transitory computer readable mediumembodying a set of software instructions, e.g., computer executableinstructions, for controlling a computer or other device to encode andcompresses stereoscopic video. Other embodiments are embodiments aredirected a computer readable medium embodying a set of softwareinstructions, e.g., computer executable instructions, for controlling acomputer or other device to decode and decompresses video on the playerend. While encoding and compression are mentioned as possible separateoperations, it should be appreciated that encoding may be used toperform compression and thus encoding may, in some include compression.Similarly, decoding may involve decompression.

The techniques of various embodiments may be implemented using software,hardware and/or a combination of software and hardware. Variousembodiments are directed to apparatus, e.g., a video data processingsystem. Various embodiments are also directed to methods, e.g., a methodof processing video data. Various embodiments are also directed tomachine, e.g., computer, readable medium, e.g., ROM, RAM, CDs, harddiscs, etc., which include machine readable instructions for controllinga machine to implement one or more steps of a method.

Various features of the present invention are implemented using modules.Such modules may, and in some embodiments are, implemented as softwaremodules. In other embodiments the modules are implemented in hardware.In still other embodiments the modules are implemented using acombination of software and hardware. A wide variety of embodiments arecontemplated including some embodiments where different modules areimplemented differently, e.g., some in hardware, some in software, andsome using a combination of hardware and software. It should also benoted that routines and/or subroutines, or some of the steps performedby such routines, may be implemented in dedicated hardware as opposed tosoftware executed on a general purpose processor. Such embodimentsremain within the scope of the present invention. Many of the abovedescribed methods or method steps can be implemented using machineexecutable instructions, such as software, included in a machinereadable medium such as a memory device, e.g., RAM, floppy disk, etc. tocontrol a machine, e.g., general purpose computer with or withoutadditional hardware, to implement all or portions of the above describedmethods. Accordingly, among other things, the present invention isdirected to a machine-readable medium including machine executableinstructions for causing a machine, e.g., processor and associatedhardware, to perform one or more of the steps of the above-describedmethod(s).

Numerous additional variations on the methods and apparatus of thevarious embodiments described above will be apparent to those skilled inthe art in view of the above description. Such variations are to beconsidered within the scope.

What is claimed is:
 1. A method of processing a pair of images includinga left eye image and a right eye image, each of said left and right eyeimages including image values of a first type, the method comprising:generating a first overall disparity value indicating a first overalllevel of difference between values of said first type in said left eyeimage and values of said first type in said right eye image; determiningif said first overall disparity value is in a first difference rangeused to control performing of a first difference reduction operation,said first difference reduction operation being an operation on valuesof said first type included in at least one of said left eye image orsaid right eye image to reduce the overall difference between values ofsaid first type in said left eye image and values of said first type insaid right eye image; and performing said first difference reductionoperation to reduce the first overall level of difference between valuesof said first type in said left eye image and values of said first typein said right eye image when said first overall disparity value isdetermined to be in said first difference range.
 2. The method of claim1 wherein said values of said first type are one of luminance values,chrominance values corresponding to a first color, or chrominance valuescorresponding to a second color.
 3. The method of claim 1, whereindetermining if said first overall disparity value is in the firstdifference range used to control performing of the first differencereduction operation includes: comparing said first overall disparityvalue to a first threshold corresponding to a lower boundary of saidfirst difference range.
 4. The method of claim 3, wherein determining ifsaid first overall disparity value is in the first difference range usedto control performing of the first difference reduction operationfurther includes: comparing said first overall disparity value to asecond threshold corresponding to an upper boundary of said firstdifference range.
 5. The method of claim 4, wherein determining if saidfirst overall disparity value is in the first difference range used tocontrol performing of the first difference reduction operation furtherincludes: determining that said overall disparity value is in the firstdifference range when said comparing steps indicate that said firstoverall disparity value is above said first threshold and below saidsecond threshold.
 6. The method of claim 1, wherein performing saidfirst difference reduction operation includes: modifying values of thefirst type in a first one of said left and right eye images to reduce anoverall disparity between the values of the first type in a first regionof the first one of said left and right eye images and the values of thefirst type included in a second one of said left and right eye images,said second one of said left and right eye images being different fromsaid first one of said left and right eye images.
 7. The method of claim6, wherein performing said first difference reduction operation furtherincludes: generating a depth map for each of said left and right eyeimages; and identifying different regions in said left and right eyeimages based on depths to which the regions are determined tocorrespond, said depths including at least a first image depth and asecond image depth.
 8. The method of claim 7, wherein said first imagedepth corresponds to a first distance range from a camera to objects inan image region identified as corresponding to said first image depthand a second image depth corresponds to a second distance range from thecamera to objects in an image region identified as corresponding to saidsecond image depth, said first and second image depths being different.9. The method of claim 7, wherein said first and second image depthscorrespond to foreground and background image regions, respectively. 10.An apparatus for processing a pair of images including a left eye imageand a right eye image, each of said left and right eye images includingimage values of a first type, comprising: a disparity value generationmodule configured to generate a first overall disparity value indicatinga first overall level of difference between values of said first type insaid left eye image and values of said first type in said right eyeimage; a determination module configured to determine if said firstoverall disparity value is in a first difference range used to controlperforming of a first difference reduction operation, said firstdifference reduction operation being an operation on values of saidfirst type included in at least one of said left eye image or said righteye image to reduce the overall difference between values of said firsttype in said left eye image and values of said first type in said righteye image; and a difference reduction operation module configured toperform said first difference reduction operation to reduce the firstoverall level of difference between values of said first type in saidleft eye image and values of said first type in said right eye imagewhen said first overall disparity value is determined to be in saidfirst difference range.
 11. The apparatus of claim 10, wherein saidvalues of said first type are one of luminance values, chrominancevalues corresponding to a first color, or chrominance valuescorresponding to a second color.
 12. The apparatus of claim 10, whereinsaid determination module includes: a comparison module configured tocompare said first overall disparity value to a first thresholdcorresponding to a lower boundary of said first difference range. 13.The apparatus of claim 12, wherein said comparison module is furtherconfigured to compare said first overall disparity value to a secondthreshold corresponding to an upper boundary of said first differencerange.
 14. The apparatus of claim 13, wherein said determination moduleis further configured to determine that said overall disparity value isin the first difference range, when a result of a comparison performedby said comparison module indicates that said first overall disparityvalue is above said first threshold and below said second threshold. 15.The apparatus of claim 10, wherein said difference reduction operationmodule includes: a modification module configured to modify values ofthe first type in a first one of said left and right eye images toreduce an overall disparity between the values of the first type in afirst region of the first one of said left and right eye images and thevalues of the first type included in a second one of said left and righteye images, said second one of said left and right eye images beingdifferent from said first one of said left and right eye images.
 16. Theapparatus of claim 15, wherein said difference reduction operationmodule further includes: a depth map generation module configured togenerate a depth map for each of said left and right eye images; and anidentification module configured to identify different regions in saidleft and right eye images based on depths to which the regions aredetermined to correspond, said depths including at least a first imagedepth and a second image depth.
 17. The apparatus of claim 16, whereinsaid first image depth corresponds to a first distance range from acamera to objects in an image region identified as corresponding to saidfirst image depth and a second image depth corresponds to a seconddistance range from the camera to objects in an image region identifiedas corresponding to said second image depth, said first and second imagedepths being different.
 18. The apparatus of claim 16, wherein saidfirst and second image depths correspond to foreground and backgroundimage regions, respectively.
 19. A non-transitory computer readablemedium for use in an apparatus for processing a pair of images includinga left eye image and a right eye image, each of said left and right eyeimages including image values of a first type, the non-transitorycomputer readable medium comprising: instructions which when executed bya processor cause said processor to generate a first overall disparityvalue indicating a first overall level of difference between values ofsaid first type in said left eye image and values of said first type insaid right eye image; instructions which when executed by said processorcause said processor to determine if said first overall disparity valueis in a first difference range used to control performing of a firstdifference reduction operation, said first difference reductionoperation being an operation on values of said first type included in atleast one of said left eye image or said right eye image to reduce theoverall difference between values of said first type in said left eyeimage and values of said first type in said right eye image; andinstructions which when executed by said processor cause said processorto perform the first difference reduction operation to reduce the firstoverall level of difference between values of said first type in saidleft eye image and values of said first type in said right eye imagewhen said first overall disparity value is determined to be in saidfirst difference range.
 20. The non-transitory computer readable mediumof claim 19, further comprising: instructions which when executed bysaid processor cause said processor to compare said first overalldisparity value to a first threshold corresponding to a lower boundaryof said first difference range; and instructions which when executed bysaid processor cause said processor to compare said first overalldisparity value to a second threshold corresponding to an upper boundaryof said first difference range.